Epidemic spreading in correlated complex networks 
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We study a dynamical model of epidemic spreading on complex networks in which there are explicit 
correlations among the node's connectivities. For the case of Markovian complex networks, showing 
only correlations between pairs of nodes, we find an epidemic threshold inversely proportional to 
the largest eigenvalue of the connectivity matrix that gives the average number of links that from a 
node with connectivity k go to nodes with connectivity k' . Numerical simulations on a correlated 
growing network model provide support for our conclusions. 
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Statistical physics has witnessed in recent years a re- 
newed interest in graph theory due to the discovery that 
many natural and artificial systems can be described in 
terms of complex networks, in which the nodes represent 
typical units, and the links represent the interactions be- 
tween pairs of units |^. The term complex network 
has been coined to refer to networks that typically exhibit 
two distinct properties: (i) A scale-free (SF) connectivity 
distribution. If we define the connectivity distribution, 
P(fc), as the probability that a node is connected to k 
other nodes, then SF networks are characterized by a 
power-law behavior P{k) ^ where 7 is a charac- 

teristic exponent. This property renders classical mod- 
els of random graphs described by an exponentially 
bounded connectivity, inappropriate for the description 
of many real networks, (ii) The small-world property 
that is defined by an average path length — average dis- 
tance between any pair of nodes — increasing very slowly 
(usually logarithmically) with the network size N. 

Apart from the empirical characterization of real com- 
plex networks and the development of new models, ac- 
counting for the observed properties, the interest in this 
field has been also driven by the discovery of the pro- 
found and subtle effects that the connectivity has on the 
behavior of dynamical systems defined on top of complex 
networks. These effects are particularly interesting in the 
study of disease transmission in SF networks, relevant for 
the understanding of the spreading of computer viruses 
and sexually transmitted diseases ||]. Indeed, it was 
first noted in Ref. that in uncorrelated SF networks 
with a connectivity exponent 7 < 3, epidemic processes 
do not posses an epidemic threshold below which dis- 
eases cannot produce a macroscopic epidemic outbreak 
or the development of an endemic state. This feature, 
observed in several epidemic models |0, ^, 0, is 

deeply rooted in the presence of very large connectivity 
fluctuations in infinite SF networks. 

The study of epidemic spreading in uncorrelated com- 
plex networks (that is, in graphs in which the connec- 
tivity of any node is independent of the connectivity of 
its neighbors) has been proved to be extremely success- 



ful, providing, for instance, the first satisfactory expla- 
nation of the long-standing problem of the generalized 
low prevalence of computer viruses without assuming any 
global tuning of microscopic parameters 0, |l^. Never- 
theless, it represents a first approximation to real net- 
works, that neglects the possibility of correlations in the 
connectivity of the nodes. The existence and importance 
of connectivity correlations has been recently pointed out 
in the literature. In fact, it has become clear that these 
correlations are critical in the understanding of the hi- 
erarchical structure of the Internet |l5| . On the 
other hand, some growing network models have been pro- 
posed [ [l6t , in which correlations are spontaneously gen- 
erated and have important effects in the percolation tran- 
sition. 

In this paper we shall present a study of epidemic 
spreading in complex random networks in which there 
are explicit correlations among the node's connectivities. 
We will consider in particular the subset of undirected 
Markovian random networks, that are completely defined 
by their connectivity distribution P{k) and the condi- 
tional probability P(k' \ k) that a node of connectivity k 
is connected to a node of connectivity k' . These two func- 
tions can have any form (including SF behavior), and are 
assumed to be normafized (^j, P{k) — ^j,, P{k' | fc) = 1) 
and restricted by the connectivity detailed balance con- 
dition 

kP{k' I k)P{k) ^ k'P(k I k')P{k') = (fc) P{k, k'), (1) 

where the symmetric function (2 — 5kk')P{k,k') is the 
joint probability that two nodes of connectivity k and 
k' are connected. The Markovian nature of this class 
of networks implies that all higher order correlations can 
be expressed as a function of P{k) and P{k' \ k), allowing 
an exact treatment of epidemic models at the mean-field 
(MF) level. It is worth noticing, however, that a more de- 
tailed description, in terms of a Langevin equation (to be 
reported elsewhere ||l^), yields exactly the same results, 
confirming the accuracy of the MF description. In this 
framework, the topologically relevant magnitude is the 
connectivity matrix Ckk' = kP{k' \ fc), that measures the 
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average number of links that go from a node with con- 
nectivity k to nodes with connectivity k' . We will show 
that the epidemic threshold is related to the largest eigen- 
value of this matrix. Extensive numerical simulations on 
a random correlated network model [ p6| confirm the pre- 
dictions of the present analysis. During the completion 
of this work we became aware of two recent preprints 
| p8| , in which it is also highlighted the general role 
of correlations in spreading and percolation in complex 
networks. 

In order to study the effects of connectivity correla- 
tions in epidemic spreading, we will focus in the standard 



susceptible- infected-susceptible (SIS) model 20 1. All the 
results, however, can be easily extended to the more 
general susceptible-infected-removed-susceptible (SIRS) 
model 10. In the SIS model each node in the network 
represents an individual, and each link represents a con- 
nection along which the infection can propagate. Sus- 
ceptible (healthy) nodes become infected with probabil- 
ity V if they are connected to one or more infected nodes. 
On the other hand, infected nodes recover spontaneously 
with probability S. The ratio of this two rates defines an 
effective spreading rate \ = v/5 (without lack of general- 
ity, we set 5—1). For homogeneous networks, in which 
each node has more or less the same number of connec- 
tions, A: ~ (fc), a general result states the existence of 
a finite epidemic threshold, separating an infected (en- 
demic) phase, with a finite average density of infected 
individuals, from a healthy phase, in which the infection 
dies out exponentially fast. In terms of the average den- 
sity of infected individuals p{t) (the prevalence) we can 
describe the SIS model in homogeneous networks at a 
MF level by the following rate equation [p| 



dtp{t)^-p{t) + \{k)p{t) [l-p{t)]. 



(2) 



In this equation we have neglected higher order terms, 
since we are interested in the onset of the endemic state, 
close to the point p{t) ^ 0. Also, we have neglected corre- 
lations among nodes. That is, the probability of infection 
of a new node — the second term in Eq. (^) — is propor- 
tional to the infection rate A, to the probability that a 
node is healthy, 1 — pit), and to the probability that a 
link in a healthy node points to an infected node. This 
last quantity, assuming the homogeneous mixing hypoth- 
esis | |2l| , is approximated for homogeneous networks as 
{k) p{t), i.e. proportional to the average number of con- 
nection and to the density of infected individuals, and 
independent of the connectivity. From Eq. (^ it can be 
proved the existence of an epidemic threshold Ac ~ 
pot , such that p = if A < Ac, while p ~ (A - Ac) 
if A > Ac- In this context, it is easy to recognize that 
the SIS model is a generalization of the contact process 
model, widely studied as the paradigmatic example of an 
absorbing-state phase transition to a unique absorbing 
state lEl. 



For general complex networks, in which large connec- 
tivity fluctuations and correlations might be allowed, we 
must relax the homogeneous hypothesis made in writing 
Eq. (H) and work instead with the relative density pk (t) of 
infected nodes with given connectivity fc; i.e. the prob- 
ability that a node with k links is infected. Following 
Refs. ||], the rate equation for pk{t) can be written as 



dpkjt) 
dt 



-Pk{t) + \k[l - pk{t)]Qk{t). 



(3) 



In this case, the creation term is proportional to the 
spreading rate A, the density of healthy sites 1 — Pk{t), 
the connectivity k, and the variable Qk{t), that stands 
for the probability that a link emanating from a node of 
connectivity k points to an infected site. In the case of an 
uncorrelated random network, considered in Refs. ^, 
the probability that a link points to a node of connec- 
tivity k' is independent of the connectivity k of the node 
from which the link is emanating. Therefore, in this case 
Qk = 0"*^ is independent of k and can be written as 



(fc) 



Y,k'P{k')pk'{t), 



(4) 



since the probability that a node is pointing to a node 
of connectivity k' is proportional to k'Pik'). Substitut- 
ing the expression (^ into Eq. (^), one can solve for the 
steady state solution and find the existence of an epi- 
demic threshold Ac, below which there are no solutions 
with a nonzero value of 0'"^. The expression of the epi- 
demic threshold for uncorrelated random networks is 



Ar = 



(5) 



For infinite SF networks with 7 < 3, we have (fc^) = 00, 
and correspondingly A"'^ = 0. Finally, from the solution 
of pk, one can compute the total prevalence p using the 
relation p = P{k)pk- 

For a general network in which the connectivities of the 
nodes are correlated, the above formalism is not correct, 
since we are not considering the effect of the connectivity 
k into the expression for 8^. This effect can be taken into 
account, however, for Markovian networks, whose corre- 
lations are completely defined by the conditional proba- 
bility P{k' \k). In this case, it is easy to realize that the 
correct factor Qk can be written as 



ek{t) = J2Pik'\k)pk'{t), 



(6) 



that is, the probability that a link in a node of connec- 
tivity k is pointing to an infected node is proportional 
to the probability that any link points to a node with 
connectivity /c', times the probability that this node is 
infected, Pk'{t), averaged over all the nodes connected 
to the original node. Eqs. (||) and (||) define together 
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the MF equation describing the SIS model on Markovian 
complex networks, 



dpkjt) 
dt 



-Pk{t) + Afc [1 - Pk{t)] Y,P{k'\ k)pk'{t). (7) 



The exact solution of Eq. (|^) can be difficult to find, 
depending on the particular form of P{k' \ k). However, it 
is possible to extract the value of the epidemic threshold 
by analyzing the stability of the steady-state solutions. 
Of course, the healthy state = is one solution. For 
small pk, we can linearize Eq. (|^), getting 



dt 



Lkk'Pk' (t)- 



(8) 



In the previous equation we have defined the Jacobian 
matrix L = {Lkk' } by 



Lkk' = -Skk' +\kP{k'\k), 



(9) 



where 6kk' is the Kroncckcr delta function. The solution 
Pk = will be unstable if there exists at least one posi- 
tive eigenvalue of the Jacobian matrix L. Let us consider 
the connectivity matrix C, defined by Ckk' = kP{k' \ k). 
Using the symmetry condition Eq. (|l|), it is easy to check 
that if Vk is an eigenvector of C, with eigenvalue A, then 
P(k)vk is an eigenvector of the transposed matrix Cy^ 
with the same eigenvalue. From here it follows immedi- 
ately that all the eigenvalues of C are real. Let A„i be the 
largest eigenvalue of C. Then, the origin will be unsta- 
ble whenever —1-1- AA^ > 0, which defines an epidemic 
threshold 
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(10) 



above which the solution pk — Q '\s unstable, and another 
nonzero solution takes over as the actual steady-state — 
the endemic state. 

It is instructive to see how this general formalism re- 
covers previous results 0, || , implicitly obtained for ran- 
dom uncorrelated networks. For any random network, 
in which there are no correlations among the connectiv- 
ities of the nodes, we have that the connectivity matrix 
is given by 0^, = kP{k'/k) = kk'P{k')/ (fc), since the 
probability that a link points to a node of connectivity 
fc' is proportional to fc'P(fc'). It is easy to check that the 
matrix {C^fk} has unique eigenvalue K'^ = (fc^) /(fc), 
corresponding to the eigenvector w^'^ — fc, from where we 
recover the now established result Eq. (||) . 

In order to check the theoretical prediction Eq. (10), 



we have performed numerical simulations of the network 
model proposed by Callaway et al. |jl^. This model 
is very simply defined: each time step we add a new 
node, and, with probability 6, two nodes are randomly 
selected and joined with a link. This recipe yields a ran- 
dom network with a exponential connectivity distribu- 
tion, P(fc) = (2(5)'''/(l + 2(5)*"'+^, which shows correlations 
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FIG. 1: Total prevalence p for the SIS model in Callaway's 
network and in a MR uncorrelated network with the same 
connectivity distribution. 



among the connectivities of the nodes. These correla- 
tions can be analytically computed by means of the joint 
probability P(fc, fc'), which fulfills the recursion relation 




P(fc,fc') = 



2b 



[P(fc-l,fc')+P(fc,fc'-l)] 



1 + 45 
P(fc - l)P(fc' - 1) 
1 + 45 



Even though it is not possible to solve for P(fc, fc') in 
a closed form, we can numerically estimate the largest 
eigenvalue of the connectivity matrix C by generating a 
finite matrix P(fc, fc') and using the relation (^ to find 
P(fc' I fc). The correctedness of this numerical estimate is 
ensured by the exponential decay of P(fc, fc') for large val- 
ues of fc and fc', as can be seen from the recursion relation. 
The estimate of the largest eigenvalue of the connectiv- 
ity matrix for Callaway's model is A,„ ~ 6.47656, which 
yields an epidemic threshold Ac ~ 0.15. This value is 
to be compared with the prediction for an uncorrelated 
network, A^'^ = (fc) / (fc^) = 0.20. 

Figure |^ shows the results of numerical simulations of 
the SIS model on Callaway's network, as well as on ran- 
dom networks with the same connectivity distribution, 
generated using the MoUoy and Reed (MR) algorithm 
p3| , p4| . The MR algorithm generates a random network 
with a prescribed connectivity distribution and no cor- 
relations among nodes, and thus it is expected to yield 
an epidemic threshold given by A"'^. Simulations were 
performed for a fixed value (5 = 1 in networks of size up 
to = 10^, averaging over at least 100 different start- 
ing configurations, performed on at least 10 different re- 
alizations of the network. Figure |^ depicts the steady 
state prevalence as a function of the spreading rate A. 
For the MR network, the function p(A) shows a clear 
linear behavior. The epidemic threshold estimated from 
a least-squares fitting is Ac = 0.21 + 0.01, in excellent 
agreement with the prediction for an uncorrelated net- 
work. On the other hand, Callaway's network exhibits a 
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ther work is necessary in this direction. 
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FIG. 2; Fit of the prevalence for the SIS model in Callaway's 
network to the form p(A) ~ exp[— a(A — Ac)"^''^]. The fit 
yields the values a = 1.52 ± 0.02 and A^ = 0.11 ± 0.02. 



very different behavior, which might be compatible with 
the presence of a transition of infinite order |2^. In fact, 
Fig. |l| is reminiscent of the behavior found in Ref. jl^ 
for the giant component of the network as a function 
of the parameter S. In that work, the size of the giant 
component was fitted to an stretched exponential with 
exponent 1/2. Guided by this intuition, in Fig. ^ we 
perform a fit of the stationary prevalence in Callaway's 
model to the form p{X) - exp[-Q!(A - Ac)"^/^] ||l|l. The 
fit yields a prefactor a ~ 1.52 ± 0.02, and an epidemic 
threshold Ac — 0.11 ±0.02, smaller by a factor 2 than the 
value corresponding to an uncorrelated random network, 
and in quite good agreement with the prediction from 
the largest eigenvalue of the connectivity matrix. 
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III summary, we have shown that, m the presence 
correlations, the epidemic threshold m complex networks 
is determined by the connectivity matrix C, and not by 
the connectivity distribution P{k), as happens in uncor- 
related networks. This fact implies that the previously 
predicted null epidemic threshold for SF networks with 
7 < 3 might be shifted in correlated graphs, attaining a 
positive value depending on the nature of the correlations 
as given by the connectivity matrix. At this respect, it 
might be surprising that some complex networks, such 
as the Barabasi- Albert (BA) graph are exactly de- 
scribed at the uncorrelated level given by Eq. (||) with 
0fe independent of fc @, This fact must be taken as 
an evidence of the lack of correlations in the BA model; 
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lackl that, on the other hand, has been already checked 



numerically m Kct. [[l4|. ihc tormahsm presented m this 
paper represents a refinement over previous works be- 
cause it includes the effects of correlations between pairs 
of nodes and, in this sense, it is exact for Markovian 
networks. Real networks, such as the Internet, however, 
posses a more complex correlation structure. Our formal- 
ism will provide an improved approximation to epidemic 
dynamics in these cases, but it still remains the task of 
ascertaining the effects of higher order correlations. Fur- 
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